Overview

Dataset statistics

Number of variables13
Number of observations569
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory57.9 KiB
Average record size in memory104.2 B

Variable types

NUM13

Reproduction

Analysis started2020-06-04 21:40:19.573383
Analysis finished2020-06-04 21:40:57.625324
Duration38.05 seconds
Versionpandas-profiling v2.8.0
Command linepandas_profiling --config_file config.yaml [YOUR_FILE.csv]
Download configurationconfig.yaml

Warnings

perimeter_mean is highly correlated with radius_mean and 1 other fieldsHigh correlation
radius_mean is highly correlated with perimeter_mean and 1 other fieldsHigh correlation
area_mean is highly correlated with radius_mean and 1 other fieldsHigh correlation
concave points_mean is highly correlated with concavity_meanHigh correlation
concavity_mean is highly correlated with concave points_meanHigh correlation
perimeter_se is highly correlated with radius_seHigh correlation
radius_se is highly correlated with perimeter_seHigh correlation
concavity_mean has 13 (2.3%) zeros Zeros
concave points_mean has 13 (2.3%) zeros Zeros

Variables

radius_mean
Real number (ℝ≥0)

HIGH CORRELATION

Distinct count456
Unique (%)80.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean14.127291739894552
Minimum6.981
Maximum28.11
Zeros0
Zeros (%)0.0%
Memory size4.6 KiB

Quantile statistics

Minimum6.981
5-th percentile9.5292
Q111.7
median13.37
Q315.78
95-th percentile20.576
Maximum28.11
Range21.129
Interquartile range (IQR)4.08

Descriptive statistics

Standard deviation3.524048826
Coefficient of variation (CV)0.2494497099
Kurtosis0.8455216229
Mean14.12729174
Median Absolute Deviation (MAD)1.9
Skewness0.9423795717
Sum8038.429
Variance12.41892013
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
12.3440.7%
 
12.7730.5%
 
15.4630.5%
 
12.8930.5%
 
13.0530.5%
 
11.7130.5%
 
13.8530.5%
 
11.8930.5%
 
10.2630.5%
 
12.1830.5%
 
11.0630.5%
 
12.4630.5%
 
13.1730.5%
 
1330.5%
 
11.630.5%
 
12.0520.4%
 
19.420.4%
 
14.9520.4%
 
12.3620.4%
 
12.5420.4%
 
13.7720.4%
 
14.8720.4%
 
13.8720.4%
 
13.2820.4%
 
15.7820.4%
 
Other values (431)50388.4%
 
ValueCountFrequency (%) 
6.98110.2%
 
7.69110.2%
 
7.72910.2%
 
7.7610.2%
 
8.19610.2%
 
8.21910.2%
 
8.57110.2%
 
8.59710.2%
 
8.59810.2%
 
8.61810.2%
 
ValueCountFrequency (%) 
28.1110.2%
 
27.4210.2%
 
27.2210.2%
 
25.7310.2%
 
25.2210.2%
 
24.6310.2%
 
24.2510.2%
 
23.5110.2%
 
23.2910.2%
 
23.2710.2%
 

texture_mean
Real number (ℝ≥0)

Distinct count479
Unique (%)84.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean19.289648506151142
Minimum9.71
Maximum39.28
Zeros0
Zeros (%)0.0%
Memory size4.6 KiB

Quantile statistics

Minimum9.71
5-th percentile13.088
Q116.17
median18.84
Q321.8
95-th percentile27.15
Maximum39.28
Range29.57
Interquartile range (IQR)5.63

Descriptive statistics

Standard deviation4.301035768
Coefficient of variation (CV)0.2229711841
Kurtosis0.7583189724
Mean19.28964851
Median Absolute Deviation (MAD)2.81
Skewness0.6504495421
Sum10975.81
Variance18.49890868
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
14.9330.5%
 
15.730.5%
 
18.930.5%
 
16.8430.5%
 
17.4630.5%
 
18.2230.5%
 
20.5230.5%
 
16.8530.5%
 
19.8330.5%
 
18.8920.4%
 
13.9820.4%
 
20.2220.4%
 
18.1820.4%
 
18.6120.4%
 
21.2520.4%
 
27.1520.4%
 
21.8420.4%
 
14.9620.4%
 
16.5820.4%
 
20.7620.4%
 
21.5320.4%
 
15.5120.4%
 
21.4620.4%
 
21.5920.4%
 
13.920.4%
 
Other values (454)51089.6%
 
ValueCountFrequency (%) 
9.7110.2%
 
10.3810.2%
 
10.7210.2%
 
10.8210.2%
 
10.8910.2%
 
10.9110.2%
 
10.9410.2%
 
11.2810.2%
 
11.7910.2%
 
11.8910.2%
 
ValueCountFrequency (%) 
39.2810.2%
 
33.8110.2%
 
33.5610.2%
 
32.4710.2%
 
31.1210.2%
 
30.7210.2%
 
30.6210.2%
 
29.9710.2%
 
29.8110.2%
 
29.4310.2%
 

perimeter_mean
Real number (ℝ≥0)

HIGH CORRELATION

Distinct count522
Unique (%)91.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean91.96903339191564
Minimum43.79
Maximum188.5
Zeros0
Zeros (%)0.0%
Memory size4.6 KiB

Quantile statistics

Minimum43.79
5-th percentile60.496
Q175.17
median86.24
Q3104.1
95-th percentile135.82
Maximum188.5
Range144.71
Interquartile range (IQR)28.93

Descriptive statistics

Standard deviation24.29898104
Coefficient of variation (CV)0.2642082899
Kurtosis0.9722135477
Mean91.96903339
Median Absolute Deviation (MAD)12.71
Skewness0.9906504254
Sum52330.38
Variance590.4404795
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
82.6130.5%
 
134.730.5%
 
87.7630.5%
 
13020.4%
 
58.7920.4%
 
133.820.4%
 
85.9820.4%
 
113.420.4%
 
81.3520.4%
 
84.0820.4%
 
73.3420.4%
 
107.120.4%
 
130.720.4%
 
88.3720.4%
 
117.420.4%
 
88.7320.4%
 
78.8320.4%
 
78.2920.4%
 
79.1920.4%
 
102.420.4%
 
132.420.4%
 
71.4920.4%
 
82.6920.4%
 
132.920.4%
 
87.2120.4%
 
Other values (497)51690.7%
 
ValueCountFrequency (%) 
43.7910.2%
 
47.9210.2%
 
47.9810.2%
 
48.3410.2%
 
51.7110.2%
 
53.2710.2%
 
54.0910.2%
 
54.3410.2%
 
54.4210.2%
 
54.5310.2%
 
ValueCountFrequency (%) 
188.510.2%
 
186.910.2%
 
182.110.2%
 
174.210.2%
 
171.510.2%
 
166.210.2%
 
165.510.2%
 
158.910.2%
 
155.110.2%
 
153.510.2%
 

area_mean
Real number (ℝ≥0)

HIGH CORRELATION

Distinct count539
Unique (%)94.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean654.8891036906855
Minimum143.5
Maximum2501.0
Zeros0
Zeros (%)0.0%
Memory size4.6 KiB

Quantile statistics

Minimum143.5
5-th percentile275.78
Q1420.3
median551.1
Q3782.7
95-th percentile1309.8
Maximum2501
Range2357.5
Interquartile range (IQR)362.4

Descriptive statistics

Standard deviation351.9141292
Coefficient of variation (CV)0.5373644594
Kurtosis3.652302762
Mean654.8891037
Median Absolute Deviation (MAD)153.3
Skewness1.645732176
Sum372631.9
Variance123843.5543
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
512.230.5%
 
121420.4%
 
399.820.4%
 
758.620.4%
 
107520.4%
 
372.720.4%
 
684.520.4%
 
716.620.4%
 
113820.4%
 
658.820.4%
 
559.220.4%
 
506.320.4%
 
321.620.4%
 
334.220.4%
 
537.320.4%
 
477.320.4%
 
361.620.4%
 
575.320.4%
 
514.320.4%
 
641.220.4%
 
52020.4%
 
107620.4%
 
466.120.4%
 
126420.4%
 
43220.4%
 
Other values (514)51891.0%
 
ValueCountFrequency (%) 
143.510.2%
 
170.410.2%
 
178.810.2%
 
18110.2%
 
201.910.2%
 
203.910.2%
 
221.210.2%
 
221.310.2%
 
221.810.2%
 
224.510.2%
 
ValueCountFrequency (%) 
250110.2%
 
249910.2%
 
225010.2%
 
201010.2%
 
187810.2%
 
184110.2%
 
176110.2%
 
174710.2%
 
168610.2%
 
168510.2%
 

smoothness_mean
Real number (ℝ≥0)

Distinct count474
Unique (%)83.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.09636028119507908
Minimum0.052629999999999996
Maximum0.1634
Zeros0
Zeros (%)0.0%
Memory size4.6 KiB

Quantile statistics

Minimum0.05263
5-th percentile0.075042
Q10.08637
median0.09587
Q30.1053
95-th percentile0.11878
Maximum0.1634
Range0.11077
Interquartile range (IQR)0.01893

Descriptive statistics

Standard deviation0.01406412814
Coefficient of variation (CV)0.1459535813
Kurtosis0.8559749304
Mean0.0963602812
Median Absolute Deviation (MAD)0.0095
Skewness0.4563237648
Sum54.829
Variance0.0001977997003
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0.100750.9%
 
0.107540.7%
 
0.105440.7%
 
0.11540.7%
 
0.108930.5%
 
0.103730.5%
 
0.0946230.5%
 
0.104930.5%
 
0.0851130.5%
 
0.106630.5%
 
0.114130.5%
 
0.109630.5%
 
0.102430.5%
 
0.109930.5%
 
0.115830.5%
 
0.11730.5%
 
0.106330.5%
 
0.108230.5%
 
0.104430.5%
 
0.0983130.5%
 
0.0877220.4%
 
0.100320.4%
 
0.11220.4%
 
0.103120.4%
 
0.0908720.4%
 
Other values (449)49486.8%
 
ValueCountFrequency (%) 
0.0526310.2%
 
0.0625110.2%
 
0.0642910.2%
 
0.0657610.2%
 
0.0661310.2%
 
0.0682810.2%
 
0.0688310.2%
 
0.0693510.2%
 
0.069510.2%
 
0.0695510.2%
 
ValueCountFrequency (%) 
0.163410.2%
 
0.144710.2%
 
0.142510.2%
 
0.139810.2%
 
0.137110.2%
 
0.133510.2%
 
0.132610.2%
 
0.132310.2%
 
0.129110.2%
 
0.128610.2%
 

compactness_mean
Real number (ℝ≥0)

Distinct count537
Unique (%)94.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.10434098418277679
Minimum0.01938
Maximum0.3454
Zeros0
Zeros (%)0.0%
Memory size4.6 KiB

Quantile statistics

Minimum0.01938
5-th percentile0.04066
Q10.06492
median0.09263
Q30.1304
95-th percentile0.2087
Maximum0.3454
Range0.32602
Interquartile range (IQR)0.06548

Descriptive statistics

Standard deviation0.05281275793
Coefficient of variation (CV)0.5061554512
Kurtosis1.650130467
Mean0.1043409842
Median Absolute Deviation (MAD)0.03263
Skewness1.190123031
Sum59.37002
Variance0.0027891874
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0.120630.5%
 
0.114730.5%
 
0.0499420.4%
 
0.133920.4%
 
0.126720.4%
 
0.208720.4%
 
0.0772220.4%
 
0.0769820.4%
 
0.0383420.4%
 
0.159920.4%
 
0.130520.4%
 
0.1720.4%
 
0.115420.4%
 
0.128920.4%
 
0.148320.4%
 
0.0950920.4%
 
0.131320.4%
 
0.151620.4%
 
0.114120.4%
 
0.102120.4%
 
0.0574320.4%
 
0.122320.4%
 
0.111720.4%
 
0.130620.4%
 
0.130420.4%
 
Other values (512)51790.9%
 
ValueCountFrequency (%) 
0.0193810.2%
 
0.0234410.2%
 
0.026510.2%
 
0.0267510.2%
 
0.0311610.2%
 
0.0321210.2%
 
0.0339310.2%
 
0.0339810.2%
 
0.0345410.2%
 
0.0351510.2%
 
ValueCountFrequency (%) 
0.345410.2%
 
0.311410.2%
 
0.286710.2%
 
0.283910.2%
 
0.283210.2%
 
0.277610.2%
 
0.27710.2%
 
0.276810.2%
 
0.266510.2%
 
0.257610.2%
 

concavity_mean
Real number (ℝ≥0)

HIGH CORRELATION
ZEROS

Distinct count537
Unique (%)94.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.0887993158172232
Minimum0.0
Maximum0.4268
Zeros13
Zeros (%)2.3%
Memory size4.6 KiB

Quantile statistics

Minimum0
5-th percentile0.0049826
Q10.02956
median0.06154
Q30.1307
95-th percentile0.24302
Maximum0.4268
Range0.4268
Interquartile range (IQR)0.10114

Descriptive statistics

Standard deviation0.07971980871
Coefficient of variation (CV)0.8977525105
Kurtosis1.998637529
Mean0.08879931582
Median Absolute Deviation (MAD)0.04046
Skewness1.401179739
Sum50.5268107
Variance0.0063552479
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0132.3%
 
0.120430.5%
 
0.0134220.4%
 
0.0334420.4%
 
0.0268820.4%
 
0.0199720.4%
 
0.0197220.4%
 
0.197420.4%
 
0.0672620.4%
 
0.108520.4%
 
0.241720.4%
 
0.0842220.4%
 
0.244820.4%
 
0.0299520.4%
 
0.111520.4%
 
0.213320.4%
 
0.0589220.4%
 
0.110320.4%
 
0.10120.4%
 
0.100720.4%
 
0.291410.2%
 
0.0239810.2%
 
0.0194710.2%
 
0.0271210.2%
 
0.0682410.2%
 
Other values (512)51290.0%
 
ValueCountFrequency (%) 
0132.3%
 
0.00069210.2%
 
0.000973710.2%
 
0.00119410.2%
 
0.00146110.2%
 
0.00148710.2%
 
0.00154610.2%
 
0.00159510.2%
 
0.00159710.2%
 
0.0018610.2%
 
ValueCountFrequency (%) 
0.426810.2%
 
0.426410.2%
 
0.410810.2%
 
0.375410.2%
 
0.363510.2%
 
0.352310.2%
 
0.351410.2%
 
0.336810.2%
 
0.333910.2%
 
0.320110.2%
 

concave points_mean
Real number (ℝ≥0)

HIGH CORRELATION
ZEROS

Distinct count542
Unique (%)95.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.04891914586994728
Minimum0.0
Maximum0.2012
Zeros13
Zeros (%)2.3%
Memory size4.6 KiB

Quantile statistics

Minimum0
5-th percentile0.0056208
Q10.02031
median0.0335
Q30.074
95-th percentile0.12574
Maximum0.2012
Range0.2012
Interquartile range (IQR)0.05369

Descriptive statistics

Standard deviation0.03880284486
Coefficient of variation (CV)0.7932036459
Kurtosis1.066555703
Mean0.04891914587
Median Absolute Deviation (MAD)0.02014
Skewness1.171180081
Sum27.834994
Variance0.001505660769
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0132.3%
 
0.0286430.5%
 
0.124220.4%
 
0.0525220.4%
 
0.0577820.4%
 
0.0203120.4%
 
0.0192420.4%
 
0.0161520.4%
 
0.0259420.4%
 
0.147120.4%
 
0.104320.4%
 
0.0227220.4%
 
0.0237720.4%
 
0.0236920.4%
 
0.0646220.4%
 
0.0155310.2%
 
0.0258310.2%
 
0.0601810.2%
 
0.059810.2%
 
0.032510.2%
 
0.0573610.2%
 
0.126510.2%
 
0.0245610.2%
 
0.00882910.2%
 
0.00850710.2%
 
Other values (517)51790.9%
 
ValueCountFrequency (%) 
0132.3%
 
0.00185210.2%
 
0.00240410.2%
 
0.00292410.2%
 
0.00294110.2%
 
0.00312510.2%
 
0.00326110.2%
 
0.00333310.2%
 
0.00347210.2%
 
0.00416710.2%
 
ValueCountFrequency (%) 
0.201210.2%
 
0.191310.2%
 
0.187810.2%
 
0.184510.2%
 
0.182310.2%
 
0.168910.2%
 
0.16210.2%
 
0.160410.2%
 
0.159510.2%
 
0.156210.2%
 

symmetry_mean
Real number (ℝ≥0)

Distinct count432
Unique (%)75.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.18116186291739894
Minimum0.106
Maximum0.304
Zeros0
Zeros (%)0.0%
Memory size4.6 KiB

Quantile statistics

Minimum0.106
5-th percentile0.1415
Q10.1619
median0.1792
Q30.1957
95-th percentile0.23072
Maximum0.304
Range0.198
Interquartile range (IQR)0.0338

Descriptive statistics

Standard deviation0.02741428134
Coefficient of variation (CV)0.1513247926
Kurtosis1.287932992
Mean0.1811618629
Median Absolute Deviation (MAD)0.0171
Skewness0.7256089734
Sum103.0811
Variance0.0007515428212
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0.171440.7%
 
0.176940.7%
 
0.189340.7%
 
0.171740.7%
 
0.160140.7%
 
0.211630.5%
 
0.19330.5%
 
0.161930.5%
 
0.186130.5%
 
0.17230.5%
 
0.195330.5%
 
0.146730.5%
 
0.180930.5%
 
0.148730.5%
 
0.192530.5%
 
0.163830.5%
 
0.173930.5%
 
0.196630.5%
 
0.151630.5%
 
0.150630.5%
 
0.177930.5%
 
0.166930.5%
 
0.188530.5%
 
0.15930.5%
 
0.166730.5%
 
Other values (407)48985.9%
 
ValueCountFrequency (%) 
0.10610.2%
 
0.116710.2%
 
0.120310.2%
 
0.121510.2%
 
0.12210.2%
 
0.127410.2%
 
0.130510.2%
 
0.130810.2%
 
0.133710.2%
 
0.133910.2%
 
ValueCountFrequency (%) 
0.30410.2%
 
0.290610.2%
 
0.274310.2%
 
0.267810.2%
 
0.265510.2%
 
0.259710.2%
 
0.259510.2%
 
0.256910.2%
 
0.255610.2%
 
0.254810.2%
 

fractal_dimension_mean
Real number (ℝ≥0)

Distinct count499
Unique (%)87.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.06279760984182776
Minimum0.049960000000000004
Maximum0.09744
Zeros0
Zeros (%)0.0%
Memory size4.6 KiB

Quantile statistics

Minimum0.04996
5-th percentile0.053926
Q10.0577
median0.06154
Q30.06612
95-th percentile0.07609
Maximum0.09744
Range0.04748
Interquartile range (IQR)0.00842

Descriptive statistics

Standard deviation0.007060362795
Coefficient of variation (CV)0.1124304382
Kurtosis3.00589212
Mean0.06279760984
Median Absolute Deviation (MAD)0.00422
Skewness1.304488813
Sum35.73184
Variance4.98487228e-05
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0.0566730.5%
 
0.0611330.5%
 
0.0591330.5%
 
0.0678230.5%
 
0.0590730.5%
 
0.0571520.4%
 
0.0675820.4%
 
0.0633120.4%
 
0.0586620.4%
 
0.0628420.4%
 
0.0554420.4%
 
0.0591220.4%
 
0.0558120.4%
 
0.0604820.4%
 
0.055820.4%
 
0.0623520.4%
 
0.0595520.4%
 
0.0601920.4%
 
0.0562320.4%
 
0.0547420.4%
 
0.0619420.4%
 
0.0669720.4%
 
0.0597620.4%
 
0.0612120.4%
 
0.0630320.4%
 
Other values (474)51490.3%
 
ValueCountFrequency (%) 
0.0499610.2%
 
0.0502410.2%
 
0.0502510.2%
 
0.0504410.2%
 
0.0505410.2%
 
0.0509610.2%
 
0.0517610.2%
 
0.0517710.2%
 
0.0518510.2%
 
0.0522310.2%
 
ValueCountFrequency (%) 
0.0974410.2%
 
0.0957510.2%
 
0.0950210.2%
 
0.0929610.2%
 
0.089810.2%
 
0.0874310.2%
 
0.084510.2%
 
0.0826110.2%
 
0.0824310.2%
 
0.0814210.2%
 

radius_se
Real number (ℝ≥0)

HIGH CORRELATION

Distinct count540
Unique (%)94.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.40517205623901575
Minimum0.1115
Maximum2.873
Zeros0
Zeros (%)0.0%
Memory size4.6 KiB

Quantile statistics

Minimum0.1115
5-th percentile0.1601
Q10.2324
median0.3242
Q30.4789
95-th percentile0.95952
Maximum2.873
Range2.7615
Interquartile range (IQR)0.2465

Descriptive statistics

Standard deviation0.277312733
Coefficient of variation (CV)0.6844320301
Kurtosis17.68672597
Mean0.4051720562
Median Absolute Deviation (MAD)0.106
Skewness3.088612166
Sum230.5429
Variance0.07690235188
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0.28630.5%
 
0.220430.5%
 
0.231520.4%
 
0.33820.4%
 
0.327620.4%
 
0.256220.4%
 
0.16320.4%
 
0.257720.4%
 
0.18420.4%
 
0.297620.4%
 
0.192420.4%
 
0.262120.4%
 
0.353420.4%
 
0.507920.4%
 
0.235120.4%
 
0.30620.4%
 
0.223920.4%
 
0.595920.4%
 
0.206720.4%
 
0.153220.4%
 
0.268420.4%
 
0.224420.4%
 
0.160120.4%
 
0.295720.4%
 
0.410120.4%
 
Other values (515)51790.9%
 
ValueCountFrequency (%) 
0.111510.2%
 
0.114410.2%
 
0.115310.2%
 
0.116610.2%
 
0.118610.2%
 
0.119410.2%
 
0.119910.2%
 
0.12110.2%
 
0.126710.2%
 
0.130210.2%
 
ValueCountFrequency (%) 
2.87310.2%
 
2.54710.2%
 
1.50910.2%
 
1.3710.2%
 
1.29610.2%
 
1.29210.2%
 
1.29110.2%
 
1.21510.2%
 
1.21410.2%
 
1.20710.2%
 

texture_se
Real number (ℝ≥0)

Distinct count519
Unique (%)91.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.2168534270650264
Minimum0.3602
Maximum4.885
Zeros0
Zeros (%)0.0%
Memory size4.6 KiB

Quantile statistics

Minimum0.3602
5-th percentile0.54014
Q10.8339
median1.108
Q31.474
95-th percentile2.212
Maximum4.885
Range4.5248
Interquartile range (IQR)0.6401

Descriptive statistics

Standard deviation0.5516483926
Coefficient of variation (CV)0.4533400493
Kurtosis5.349168692
Mean1.216853427
Median Absolute Deviation (MAD)0.3153
Skewness1.646443809
Sum692.3896
Variance0.3043159491
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
1.3530.5%
 
1.26830.5%
 
0.856130.5%
 
1.1530.5%
 
1.42820.4%
 
0.942920.4%
 
1.16920.4%
 
1.3920.4%
 
1.16620.4%
 
1.19920.4%
 
1.03320.4%
 
1.02320.4%
 
1.30520.4%
 
1.15220.4%
 
1.56320.4%
 
1.36320.4%
 
1.02720.4%
 
0.733920.4%
 
1.09520.4%
 
0.822520.4%
 
1.34220.4%
 
1.21620.4%
 
1.04620.4%
 
1.05920.4%
 
1.04520.4%
 
Other values (494)51590.5%
 
ValueCountFrequency (%) 
0.360210.2%
 
0.362110.2%
 
0.362810.2%
 
0.387110.2%
 
0.398110.2%
 
0.406410.2%
 
0.412510.2%
 
0.433410.2%
 
0.433610.2%
 
0.440210.2%
 
ValueCountFrequency (%) 
4.88510.2%
 
3.89610.2%
 
3.64710.2%
 
3.56810.2%
 
3.1210.2%
 
2.92710.2%
 
2.9110.2%
 
2.90410.2%
 
2.87810.2%
 
2.83610.2%
 

perimeter_se
Real number (ℝ≥0)

HIGH CORRELATION

Distinct count533
Unique (%)93.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.866059226713533
Minimum0.757
Maximum21.98
Zeros0
Zeros (%)0.0%
Memory size4.6 KiB

Quantile statistics

Minimum0.757
5-th percentile1.1328
Q11.606
median2.287
Q33.357
95-th percentile7.0416
Maximum21.98
Range21.223
Interquartile range (IQR)1.751

Descriptive statistics

Standard deviation2.021854554
Coefficient of variation (CV)0.7054475829
Kurtosis21.40190493
Mean2.866059227
Median Absolute Deviation (MAD)0.77
Skewness3.443615202
Sum1630.7877
Variance4.087895838
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
1.77840.7%
 
3.56420.4%
 
1.14320.4%
 
1.53520.4%
 
2.40620.4%
 
1.95920.4%
 
1.56620.4%
 
1.95520.4%
 
1.10120.4%
 
2.74720.4%
 
2.22520.4%
 
2.76520.4%
 
2.87320.4%
 
2.4120.4%
 
1.24320.4%
 
3.76720.4%
 
1.99420.4%
 
2.36320.4%
 
1.49120.4%
 
2.18320.4%
 
1.66720.4%
 
2.2320.4%
 
5.80120.4%
 
2.04120.4%
 
1.44520.4%
 
Other values (508)51790.9%
 
ValueCountFrequency (%) 
0.75710.2%
 
0.771410.2%
 
0.843910.2%
 
0.848410.2%
 
0.87310.2%
 
0.921910.2%
 
0.96810.2%
 
0.981210.2%
 
0.985710.2%
 
0.988710.2%
 
ValueCountFrequency (%) 
21.9810.2%
 
18.6510.2%
 
11.0710.2%
 
10.1210.2%
 
10.0510.2%
 
9.80710.2%
 
9.63510.2%
 
9.42410.2%
 
8.86710.2%
 
8.8310.2%
 

Interactions

Correlations

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.

Phik (φk)

Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.

Missing values

Sample

First rows

radius_meantexture_meanperimeter_meanarea_meansmoothness_meancompactness_meanconcavity_meanconcave points_meansymmetry_meanfractal_dimension_meanradius_setexture_seperimeter_se
017.9910.38122.801001.00.118400.277600.300100.147100.24190.078711.09500.90538.589
120.5717.77132.901326.00.084740.078640.086900.070170.18120.056670.54350.73393.398
219.6921.25130.001203.00.109600.159900.197400.127900.20690.059990.74560.78694.585
311.4220.3877.58386.10.142500.283900.241400.105200.25970.097440.49561.15603.445
420.2914.34135.101297.00.100300.132800.198000.104300.18090.058830.75720.78135.438
512.4515.7082.57477.10.127800.170000.157800.080890.20870.076130.33450.89022.217
618.2519.98119.601040.00.094630.109000.112700.074000.17940.057420.44670.77323.180
713.7120.8390.20577.90.118900.164500.093660.059850.21960.074510.58351.37703.856
813.0021.8287.50519.80.127300.193200.185900.093530.23500.073890.30631.00202.406
912.4624.0483.97475.90.118600.239600.227300.085430.20300.082430.29761.59902.039

Last rows

radius_meantexture_meanperimeter_meanarea_meansmoothness_meancompactness_meanconcavity_meanconcave points_meansymmetry_meanfractal_dimension_meanradius_setexture_seperimeter_se
55911.5123.9374.52403.50.092610.102100.111200.041050.13880.065700.23882.9041.936
56014.0527.1591.38600.40.099290.112600.044620.043040.15370.061710.36451.4922.888
56111.2029.3770.67386.00.074490.035580.000000.000000.10600.055020.31413.8962.041
56215.2230.62103.40716.90.104800.208700.255000.094290.21280.071520.26021.2052.362
56320.9225.09143.001347.00.109900.223600.317400.147400.21490.068790.96221.0268.758
56421.5622.39142.001479.00.111000.115900.243900.138900.17260.056231.17601.2567.673
56520.1328.25131.201261.00.097800.103400.144000.097910.17520.055330.76552.4635.203
56616.6028.08108.30858.10.084550.102300.092510.053020.15900.056480.45641.0753.425
56720.6029.33140.101265.00.117800.277000.351400.152000.23970.070160.72601.5955.772
5687.7624.5447.92181.00.052630.043620.000000.000000.15870.058840.38571.4282.548